Picture for Pengcheng Zhu

Pengcheng Zhu

WenetSpeech-Wu: Datasets, Benchmarks, and Models for a Unified Chinese Wu Dialect Speech Processing Ecosystem

Add code
Jan 16, 2026
Viaarxiv icon

VoiceSculptor: Your Voice, Designed By You

Add code
Jan 15, 2026
Viaarxiv icon

Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis

Add code
Sep 26, 2025
Viaarxiv icon

REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers

Add code
Aug 07, 2025
Viaarxiv icon

Finite-Precision Conjugate Gradient Method for Massive MIMO Detection

Add code
Apr 14, 2025
Viaarxiv icon

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Add code
Mar 03, 2025
Figure 1 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 2 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 3 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 4 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Viaarxiv icon

Performance Analysis of Local Partial MMSE Precoding Based User-Centric Cell-Free Massive MIMO Systems and Deployment Optimization

Add code
Oct 08, 2024
Figure 1 for Performance Analysis of Local Partial MMSE Precoding Based User-Centric Cell-Free Massive MIMO Systems and Deployment Optimization
Figure 2 for Performance Analysis of Local Partial MMSE Precoding Based User-Centric Cell-Free Massive MIMO Systems and Deployment Optimization
Figure 3 for Performance Analysis of Local Partial MMSE Precoding Based User-Centric Cell-Free Massive MIMO Systems and Deployment Optimization
Figure 4 for Performance Analysis of Local Partial MMSE Precoding Based User-Centric Cell-Free Massive MIMO Systems and Deployment Optimization
Viaarxiv icon

M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions

Add code
Sep 24, 2024
Figure 1 for M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions
Figure 2 for M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions
Figure 3 for M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions
Figure 4 for M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions
Viaarxiv icon

E1 TTS: Simple and Fast Non-Autoregressive TTS

Add code
Sep 14, 2024
Figure 1 for E1 TTS: Simple and Fast Non-Autoregressive TTS
Figure 2 for E1 TTS: Simple and Fast Non-Autoregressive TTS
Figure 3 for E1 TTS: Simple and Fast Non-Autoregressive TTS
Figure 4 for E1 TTS: Simple and Fast Non-Autoregressive TTS
Viaarxiv icon

MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion

Add code
Sep 14, 2024
Figure 1 for MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion
Figure 2 for MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion
Figure 3 for MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion
Figure 4 for MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion
Viaarxiv icon